General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 



"I f 
\ 
i 



(Bice Univ*) 


e /<! /-? u 7/ 


\ 


- 



y 7 4 - 2^ 668 


Uncles 




INSTITUTE FOR COMPUTER SERVICES AND APPLICATIONS 

RICE UNIVERSITY 



275-025-012 


Error Detection & Data Smoothing 
Based on Local Procedures 

by 

Victor M. Guerra 
Dept, of Mathematical Sciences 
Rice University 


ABSTRACT 

This thesis presents an algorithm which is able to locate isolated bad points 
and correct them without contaminating the rest of the good data. This work 
has been greatly influenced and motivated by what is currently done in the 
manual loft. It is not within the scope of this work to handle small random 
errors characteristic of a noisy system, and it is therefore assumed that the 
bad points are isolated and relatively few when compared with the total 
number of points. 

Motivated by the desire to imitate the loftsman we conducted a visual experi- 
ment to determine what is considered smooth data by most people. This 
criterion is used to determine how much the data should be smoothed and to 
prove that our method produces such data. The method ultimately converges 
to a set of points that lies on the polynomial that interpolates the first and 
last points ; however convergence to such a set is definitely not the purpose 
of our algorithm. The proot of convergence is necessary to demonstrate 
that oscillation does not take place and that in a finite number of steps the 
method produces a set as smooth as desired. ' 

The amount of work for the method described here is of order n. The one 
dimensional and two dimensional cases are treated in detail ; the theory can 
be readily extended to higher dimensions. 
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FIGURE 15 
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FIGURE 16 
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K = I 0 «= 0.5 SMOOTHNESS *» 0,02 IT = 390 
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FIGURE 18 
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Example 7.5. In this example we apply the (k,h) -order smoothing 
algorithm to the set 

z(x,y) = exp(-(x-10) 2 - (y-20) 2 ) x,y=l,2 , . . , ,30. 

For k=h=2 this set is 0.0001-smooth. We introduce errors to 
90 points on the set. Figure 19 represents this set, the asterisk 
represents an error, each error is equal to 12,0 cm. . Table 3 
represents a list of the points that have errors and the sign of 
the error. Figure 20 represents the smoothed set. After 363 
iterations we get the desired smoothness of 0,02 cm. . The mean 
of the difference of the points in the smoothed set and the original 
set (equation (7.2)) is 0.0004 cm. . 
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CONTAMINATED DATA WITH 66 ERRORS 


OOOOOOOOOl l 11 
1 234567890123 

1 0000000000000 
2 0000000000000 

3 0000000*00000 

4 000*00*0*00*0 

5 000000**00000 

6 00000 * 00*0000 

7 0000*000000*1 

fl 0000000*01111 
9 000*000111111 

10 0000*1 1*11122 
11 0001111112222 

12 0011111222233 

13 01111122*33 3 4 

14 0111122233444 

15 1111222334445 

16 11 122233*4555 

17 111*2333445*6 

18 1122233*45566 

19 1122233445566 

20 1 1222 3 3*45667 

21 1122233445566 

22 1122233445566 

23 1112233**4556 

24 111*2233445*5 

25 111122*334445 

26 01111*2233444 

27 0111*12223*34 

28 0011111222233 

29 0001111112222 

30 0000011111122 


11 11 1 122222222223 

45678901234567890 

00000000000000000 
00000000000000000 
00000000**0000000 
0 * 0*0000000000000 
* 000 * 0000*0000000 
000 * 0000000*00000 
1111 * 1111*1111000 
111 * 11 * 111*111111 
12222 * 2222221*111 
22 * 222222222*221 1 
33333333333332222 
33444444444333322 
4444555554444*333 
555*55655*5554443 
*56666666*6555444 
66677777776*65554 
677778887*7766554 
777*88*888*776655 
7*888*88888776655 
77*8889*8887*7665 
77888888888776655 
77788888087776655 
677778*8777766554 
666777777**665554 
5566*6666665554. 4 4 
555*5565555*54443 
4*44*55*5 44444333 
33444444444333322 
33333333333332222 
2222222222222221 1 


FIGURE 19 
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SMOOTHED DATA 

K ■= 2 0 - 3/4 SMOOTHNESS » 0.02 IT = 363 


OOOOOOOOO 
1 23456789 

1 OOOOOOOOO 

2 OOOOOOOOO 

3 OOOOOOOOO 

4 OOOOOOOOO 

5 OOOOOOOOO 

6 OOOOOOOOO 

7 OOOOOOOOO 

8 OOOOOOOOO 

9 OOOOOOOll 
10 000001111 
11 000111111 

12 0 0 1 l l 1 l 2 a 

13 011111222 

14 0 1 1 1 1 2 2 2 3 

15 111122233 

15 111222334 

17 111223334 

18 112223344 

19 112223344 

20 112223344 

21 112223344 

22 112223344 

23 111223334 

24 111222334 

25 111122233 

26 011112223 

27 011111222 

28 001111122 

29 000111111 

30 000001111 


1111111111 

0123456789 

0000000000 

0000000000 

0000000000 

0000000000 

0000000000 

0000000000 

0 0 0 1 1 1 1 l 1 1 

1 1 1 l l 1 1 1 1 1 

1 1 1 1 1 2 2 2 2 2 

1122222222 
2222333333 
2233334444 
3334444455 
3444555555 
4445556666 
4555666777 
4556677770 
5566777808 
5566778886 
5667778888 
5566778888 
5566777888 
4556677778 
4555666777 
4445556666 
3444555555 
3334444453 
2233334444 
2222333333 
11 22222222 


22222222223 

01234567890 

00000000000 
00000000000 
00000000000 
ooooooooooo 
00000000000 
ooooooooooo 
111111110 0 0 
11111111111 
22222211 111 
2222222221 1 
33333332222 
44444333322 
55544444333 
65555554443 
66666555444 
77776665554 
887777 66554 
888877 7 6655 
88888776655 
98800777665 
88888776655 
88887776655 
88777766554 
77776665 5 54 
66666555444 
65555554443 
55544444333 
44444333322 
33333332222 
2222222221 1 


FIGURE 20 
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x 

J 

E 


X 

J 

Z-r , 



I,J 





-5 

23 

-9.315 


-5 

7 

-9.692 

-15 

23 

-3.505 


5 

a 

10.349 

6 

6 

10.372 


-3 

8 

-9. 830 

26 

6 

1 2. 253 


-23 

a 

-6.041 

6 

9 

10 . 542 


-13 

9 

-7.168 

16 

9 

14.105 


-26 

25 

-4.703 

-6 

17 

-9.0S1 


24 

24 

16.939 

-26 

17 

-4.261 


-20 

21 

-1 .045 

6 

25 

10.876 


27 

21 

15.160 

16 

25 

16.634 


-17 

4 

-7.739 

7 

5 

10.436 


27 

1 1 

13.459 

27 

5 

1 1 * 684 


-27 

15 

-5. 423 

7 

12 

10.976 


13 

27 

14.059 

17 

12 

15.906 


23 

9 

14.441 

7 

18 

1 1.318 


23 

20 

18.133 

27 

18 

15.083 


3 

22 

10.342 

7 

23 

1 l .285 


33 

22 

10.333 

17 

23 

17,775 


4 

4 

10.140 

8 

8 

10.867 


24 

4 

12.090 

18 

8 

14 > l 88 


-4 

7 

-9.783 

8 

17 

1 1 o 7 0 3 


4 

9 

10.276 

18 

17 

18.225 


-4 

12 

-9.633 

-8 

20 

-8.219 


-24 

12 

-4.541 

-18 

20 

-1 .396 


4 

15 

10.446 

-8 

24 

-8.356 


14 

23 

15.739 

-18 

24 

-2.058 


A. 

17 

10.483 

9 

4 

10.641 

*• 

14 

17 

15.739 

19 

15 

17,854 


-5 

7 

-19.692 

9 

19 

12.296 


-25 

7 

-7,082 

19 

19 

18.855 


-5 

14 

-9.402 

9 

27 

1 1.806 


-IS 

14 

-4.326 

-to 

5 

-9.051 


-5 

18 

-9.298 

24 

23 

17.187 


-25 

19 

^ * J-r- J 

-10 

26 

-7.559 





-20 

26 

-2.483 





-10 

8 

-8.578 





-20 

8 

-5.619 





-10 

16 

-7.303 





-20 

16 

-1.692 






TABLE 3 
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APPENDIX 1 


In this section we describe the experiment used to find 
the lower bound of the error detectabl* by human beings. From 
experimental psycology we take the next definition and assumption: 

The differential threshold i3 defined as the point c'~ which a difference 
can be detected 75% of the time. We assume that the data obtained from 
this experiment behaves like a normal distribution function with mean 
equal to zero. 

In this experiment each subject is going to look at 88 
targets that are divided in two equall groups. The first group 
is shown at 4ft. from the subject and the secong group is shown 
at 16ft. from the subject. Each target is a 8^" x 11" white 
cardboard infront of a white wall, this targets contain 5 or 6 
dots centered in the cardboard so they form a line or a semi- 
circle respectively. The dots are 1/L6" in diameter which is 
above the minimum size to be recognizable. The targets are 
shown one by one to each subject and there is no time limit to 
look at them. All the people that took this test claim to have 
normal sight (wearing glases if necessary). 

In the linear pattern one of the two center points is 
above the line that interpolates the rest, so that every subject 
dicides whether the left or right center point contains the error. 

The case when point with error is under the the interpolating 
line is not considered since it is assumed to be symmetrical 
And in the semicircular pattern all the 


to our case . 
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points belong to a semicircle (105°) enxcept: the center point which is 
either above or below the semicircle. In the cardboards the ordinates 
are 1" apart and the semicircle has 3" of radius. In Figure 21 we 
represent these patterns. 


e 


2 


T 4 - 


e 


1 


ft 






ft 


ft 


* / 


1 " 


/ ✓ 

1 " 


FIGURE 21 

The quantities e^ and e^ represent the distance between 

the points with error and the line or semicircle that interpolates the 
rest of the points. The distance e 2 may be positive or negative. 

From the preliminary tests we learned that was easier to detect 
an error in a line than In a circle, therefore the variations in the 
errors for the line are smaller than those of the semicircle. Let these 
variations be e^= 0,0, 1/100. 2/100, 3/100 and 4/100 of an inch and 

e 2 = -10/100, -8/100, -6/100, -4/100, -2/100, 0.0, 2/100, 4/100, 6/100, 

8/100, 10/100 and 12/100 of an inch. The zero errors are introduced 
to give consistency to the experiment, that is, we expect half of the 
people to give one answer and the other half to give the opposite 
answer in these cases. The targets will be ordered randomly. 



Porcentage of Porcentage of 

Correct Detection Correct Detection 
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The resultB for the line case are sumarizad in the next graphs. 



Porcentage of the error at 4 ft. 



Porcentage of the error at 16 ft. 


The quantity E represents the porcentage of the error, that 
is,' the quotient between the point error and the spacing of the points. 



Porcentage of Percentage of 

correct detection correct detection 
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To interpolate a normal density function with mean equal to zero 
we compute the variances, then we find the error that corresponds to the- 
differential threshold (i.e., 75# of correct detection). 



LINE 

LINE 

CIRCUS 

CIRCUS 

DISTANCE 

A 

16 

4 

16 

MEAN 

0 

0 

0 

0 

VARIANCE 

2.14 

2.98 

1.7 

1.6 

E 

1.0# 

1.1# 

2.8# 

2.7# 


This result means that the human eye is capable of recognizing 
errors with respect to the spacing of the points. Also it gives a lower 
bound below which it is impossible to decide whether one point is per- 
fectly smooth or not, and an upper bound above which we can recognize 
rough points. 


APPENDIX 2 


Here we present a table with the coeiicients of the first 
eight divided differences when the order is even. 


n-th 

difference 

k 

*" y i-4 

y i~3 

y i-2 

^1-1 

y i 

y i+l 

y i+2 

y i+3 

y i+4 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

2 

1 

0 

0 

0 

1 

-2 

1 

0 

0 

0 

4 

2 

0 

0 

1 

-4 

6 

-4 

1 

0 

0 

6 

3 

0 

1 

-6 

15 

0 

CM 

1 

15 

-6 

1 

0 

8 

• 

4 

• 

1 

• 

-8 

• 

28 

4 

-56 

4 

70 

9 

-56 

28 

9 

-8 

• 

1 

« 

n 

k=2n 

• 

« 

4 


(- 1 ) 

(2n) ! 

4 

• 

* 


(n-j) I(n4-j) ! 
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